Text Mining for Drugs and Chemical Compounds: Methods, Tools and Applications
Identifieur interne : 000411 ( Main/Exploration ); précédent : 000410; suivant : 000412Text Mining for Drugs and Chemical Compounds: Methods, Tools and Applications
Auteurs : Miguel Vazquez [Espagne] ; Martin Krallinger [Espagne] ; Florian Leitner [Espagne] ; Alfonso Valencia [Espagne]Source :
- Molecular Informatics [ 1868-1743 ] ; 2011-06.
English descriptors
Abstract
Providing prior knowledge about biological properties of chemicals, such as kinetic values, protein targets, or toxic effects, can facilitate many aspects of drug development. Chemical information is rapidly accumulating in all sorts of free text documents like patents, industry reports, or scientific articles, which has motivated the development of specifically tailored text mining applications. Despite the potential gains, chemical text mining still faces significant challenges. One of the most salient is the recognition of chemical entities mentioned in text. To help practitioners contribute to this area, a good portion of this review is devoted to this issue, and presents the basic concepts and principles underlying the main strategies. The technical details are introduced and accompanied by relevant bibliographic references. Other tasks discussed are retrieving relevant articles, identifying relationships between chemicals and other entities, or determining the chemical structures of chemicals mentioned in text. This review also introduces a number of published applications that can be used to build pipelines in topics like drug side effects, toxicity, and protein‐disease‐compound network analysis. We conclude the review with an outlook on how we expect the field to evolve, discussing its possibilities and its current limitations.
Url:
DOI: 10.1002/minf.201100005
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 001E38
- to stream Istex, to step Curation: 001D14
- to stream Istex, to step Checkpoint: 000067
- to stream Main, to step Merge: 000416
- to stream Main, to step Curation: 000411
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Text Mining for Drugs and Chemical Compounds: Methods, Tools and Applications</title>
<author><name sortKey="Vazquez, Miguel" sort="Vazquez, Miguel" uniqKey="Vazquez M" first="Miguel" last="Vazquez">Miguel Vazquez</name>
</author>
<author><name sortKey="Krallinger, Martin" sort="Krallinger, Martin" uniqKey="Krallinger M" first="Martin" last="Krallinger">Martin Krallinger</name>
</author>
<author><name sortKey="Leitner, Florian" sort="Leitner, Florian" uniqKey="Leitner F" first="Florian" last="Leitner">Florian Leitner</name>
</author>
<author><name sortKey="Valencia, Alfonso" sort="Valencia, Alfonso" uniqKey="Valencia A" first="Alfonso" last="Valencia">Alfonso Valencia</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:5746E755E16A7F927A9F1FD6D90E9C34418EC093</idno>
<date when="2011" year="2011">2011</date>
<idno type="doi">10.1002/minf.201100005</idno>
<idno type="url">https://api.istex.fr/document/5746E755E16A7F927A9F1FD6D90E9C34418EC093/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001E38</idno>
<idno type="wicri:Area/Istex/Curation">001D14</idno>
<idno type="wicri:Area/Istex/Checkpoint">000067</idno>
<idno type="wicri:doubleKey">1868-1743:2011:Vazquez M:text:mining:for</idno>
<idno type="wicri:Area/Main/Merge">000416</idno>
<idno type="wicri:Area/Main/Curation">000411</idno>
<idno type="wicri:Area/Main/Exploration">000411</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Text Mining for Drugs and Chemical Compounds: Methods, Tools and Applications</title>
<author><name sortKey="Vazquez, Miguel" sort="Vazquez, Miguel" uniqKey="Vazquez M" first="Miguel" last="Vazquez">Miguel Vazquez</name>
<affiliation wicri:level="3"><country xml:lang="fr">Espagne</country>
<wicri:regionArea>Centro Nacional de Investigaciones Oncológicas, Biología Computacional y Estructural, Madrid</wicri:regionArea>
<placeName><settlement type="city">Madrid</settlement>
<region nuts="2" type="region">Communauté de Madrid</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Krallinger, Martin" sort="Krallinger, Martin" uniqKey="Krallinger M" first="Martin" last="Krallinger">Martin Krallinger</name>
<affiliation wicri:level="3"><country xml:lang="fr">Espagne</country>
<wicri:regionArea>Centro Nacional de Investigaciones Oncológicas, Biología Computacional y Estructural, Madrid</wicri:regionArea>
<placeName><settlement type="city">Madrid</settlement>
<region nuts="2" type="region">Communauté de Madrid</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Leitner, Florian" sort="Leitner, Florian" uniqKey="Leitner F" first="Florian" last="Leitner">Florian Leitner</name>
<affiliation wicri:level="3"><country xml:lang="fr">Espagne</country>
<wicri:regionArea>Centro Nacional de Investigaciones Oncológicas, Biología Computacional y Estructural, Madrid</wicri:regionArea>
<placeName><settlement type="city">Madrid</settlement>
<region nuts="2" type="region">Communauté de Madrid</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Valencia, Alfonso" sort="Valencia, Alfonso" uniqKey="Valencia A" first="Alfonso" last="Valencia">Alfonso Valencia</name>
<affiliation wicri:level="3"><country xml:lang="fr">Espagne</country>
<wicri:regionArea>Centro Nacional de Investigaciones Oncológicas, Biología Computacional y Estructural, Madrid</wicri:regionArea>
<placeName><settlement type="city">Madrid</settlement>
<region nuts="2" type="region">Communauté de Madrid</region>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Molecular Informatics</title>
<title level="j" type="abbrev">Mol. Inf.</title>
<idno type="ISSN">1868-1743</idno>
<idno type="eISSN">1868-1751</idno>
<imprint><publisher>WILEY‐VCH Verlag</publisher>
<pubPlace>Weinheim</pubPlace>
<date type="published" when="2011-06">2011-06</date>
<biblScope unit="volume">30</biblScope>
<biblScope unit="issue">6‐7</biblScope>
<biblScope unit="page" from="506">506</biblScope>
<biblScope unit="page" to="519">519</biblScope>
</imprint>
<idno type="ISSN">1868-1743</idno>
</series>
<idno type="istex">5746E755E16A7F927A9F1FD6D90E9C34418EC093</idno>
<idno type="DOI">10.1002/minf.201100005</idno>
<idno type="ArticleID">MINF201100005</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">1868-1743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Chemical compounds</term>
<term>Drugs</term>
<term>Information extraction</term>
<term>Named entity recognition</term>
<term>Text mining</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Providing prior knowledge about biological properties of chemicals, such as kinetic values, protein targets, or toxic effects, can facilitate many aspects of drug development. Chemical information is rapidly accumulating in all sorts of free text documents like patents, industry reports, or scientific articles, which has motivated the development of specifically tailored text mining applications. Despite the potential gains, chemical text mining still faces significant challenges. One of the most salient is the recognition of chemical entities mentioned in text. To help practitioners contribute to this area, a good portion of this review is devoted to this issue, and presents the basic concepts and principles underlying the main strategies. The technical details are introduced and accompanied by relevant bibliographic references. Other tasks discussed are retrieving relevant articles, identifying relationships between chemicals and other entities, or determining the chemical structures of chemicals mentioned in text. This review also introduces a number of published applications that can be used to build pipelines in topics like drug side effects, toxicity, and protein‐disease‐compound network analysis. We conclude the review with an outlook on how we expect the field to evolve, discussing its possibilities and its current limitations.</div>
</front>
</TEI>
<affiliations><list><country><li>Espagne</li>
</country>
<region><li>Communauté de Madrid</li>
</region>
<settlement><li>Madrid</li>
</settlement>
</list>
<tree><country name="Espagne"><region name="Communauté de Madrid"><name sortKey="Vazquez, Miguel" sort="Vazquez, Miguel" uniqKey="Vazquez M" first="Miguel" last="Vazquez">Miguel Vazquez</name>
</region>
<name sortKey="Krallinger, Martin" sort="Krallinger, Martin" uniqKey="Krallinger M" first="Martin" last="Krallinger">Martin Krallinger</name>
<name sortKey="Leitner, Florian" sort="Leitner, Florian" uniqKey="Leitner F" first="Florian" last="Leitner">Florian Leitner</name>
<name sortKey="Valencia, Alfonso" sort="Valencia, Alfonso" uniqKey="Valencia A" first="Alfonso" last="Valencia">Alfonso Valencia</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000411 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000411 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:5746E755E16A7F927A9F1FD6D90E9C34418EC093 |texte= Text Mining for Drugs and Chemical Compounds: Methods, Tools and Applications }}
This area was generated with Dilib version V0.6.32. |